Reinforcement with iterative punishment
نویسندگان
چکیده
We consider the efficacy of various forms reinforcement learning with punishment in evolving linguistic conventions context Lewis-Skyrms signalling games. show that strategy iterative is highly effective at optimal even complex It also robust and can be easily extended to a self-tuning variety learning. briefly discuss some virtues how it may related nature.
منابع مشابه
Striatal mechanisms underlying movement, reinforcement, and punishment.
Direct and indirect pathway striatal neurons are known to exert opposing control over motor output. In this review, we discuss a hypothetical extension of this framework, in which direct pathway striatal neurons also mediate reinforcement and reward, and indirect pathway neurons mediate punishment and aversion.
متن کاملKreitzer Reinforcement , and Punishment Striatal Mechanisms Underlying Movement
Physiol. Soc.. ESSN: 1548-9221. Visit our website at http://www.the-aps.org/. American Physiological Society, 9650 Rockville Pike, Bethesda MD 20814-3991. ©2012 Int. Union Physiol. Sci./Am. the physiological developments. It is published bimonthly in February, April, June, August, October, and December by (formerly published as News in Physiological Science) publishes brief review articles on m...
متن کاملAsymmetry of reinforcement and punishment in human choice.
The hypothesis that a penny lost is valued more highly than a penny earned was tested in human choice. Five participants clicked a computer mouse under concurrent variable-interval schedules of monetary reinforcement. In the no-punishment condition, the schedules arranged monetary gain. In the punishment conditions, a schedule of monetary loss was superimposed on one response alternative. Devia...
متن کاملAn Iterative Reinforcement Approach for Fine-Grained Opinion Mining
With the in-depth study of sentiment analysis research, finer-grained opinion mining, which aims to detect opinions on different review features as opposed to the whole review level, has been receiving more and more attention in the sentiment analysis research community recently. Most of existing approaches rely mainly on the template extraction to identify the explicit relatedness between prod...
متن کاملFlexible theft and resolute punishment: Evolutionary dynamics of social behavior among reinforcement-learning agents
Existing models of the evolution of social behavior typically involve innate strategies such as tit-for-tat. Yet, both behavioral and neural evidence indicates a substantial role for learned social behavior. We explore the evolutionary dynamics of two simple social behaviors among learning agents: Theft and punishment. In our simulation, agents employ Q-learning, a common reinforcement learning...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Experimental and Theoretical Artificial Intelligence
سال: 2022
ISSN: ['1362-3079', '0952-813X']
DOI: https://doi.org/10.1080/0952813x.2022.2153272